The Data Science Design Manual

The Data Science Design Manual

  • Downloads:2467
  • Type:Epub+TxT+PDF+Mobi
  • Create Date:2021-07-02 09:54:09
  • Update Date:2025-09-06
  • Status:finish
  • Author:Steven S. Skiena
  • ISBN:3319554433
  • Environment:PC/Android/iPhone/iPad/Kindle

Summary

This book serves an introduction to data science, focusing on the skills and principles needed to build systems for collecting, analyzing, and interpreting data。 As a discipline, data science sits at the intersection of statistics, computer science, and machine learning, but it is building a distinct heft and character of its own。

In particular, the book stresses the following basic principles as fundamental to becoming a good data scientist: "Valuing Doing the Simple Things Right," laying the groundwork of what really matters in analyzing data; "Developing Mathematical Intuition," so that readers can understand on an intuitive level why these concepts were developed, how they are useful and when they work best, and; "Thinking Like a Computer Scientist, but Acting Like a Statistician," following approaches which come most naturally to computer scientists while maintaining the core values of statistical reasoning。 The book does not emphasize any particular language or suite of data analysis tools, but instead provides a high-level discussion of important design principles。

This book covers enough material for an "Introduction to Data Science" course at the undergraduate or early graduate student levels。 A full set of lecture slides for teaching this course are available at an associated website, along with data resources for projects and assignments, and online video lectures。

Other Pedagogical features of this book include: "War Stories" offering perspectives on how data science techniques apply in the real world; "False Starts" revealing the subtle reasons why certain approaches fail; "Take-Home Lessons" emphasizing the big-picture concepts to learn from each chapter; "Homework Problems" providing a wide range of exercises for self-study; "Kaggle Challenges" from the online platform Kaggle; examples taken from the data science television show "The Quant Shop," and; concluding notes in each tutorial chapter pointing readers to primary sources and additional references。

Download

Reviews

high g individual

I did not expect to meet fiction style in a manual。 Cost of resources spent on filtering the emotions of the author and the details of his personal life are not compatible with the value of the knowledge he has provided。I want to be paid for this comment。 Since my time is wasted。

Wladston Filho

Incredibly well written and complete book。 It’s my go-to book to recommend to technical people wanting to dig deeper into data science。

Sebastian

One of the best resources at the time to go into the matter of Data Science。 Highly recommended, as any of the S。 Skienna books (which he also recommends reading for a better understanding of some topics)。

Natu Lauchande

Great reference book 。 Mostly from a CS perspective with lot's of intuition 。 Currently a reference book for all my ML/DS related projects。 Great reference book 。 Mostly from a CS perspective with lot's of intuition 。 Currently a reference book for all my ML/DS related projects。 。。。more

Paweł Kacprzak

Nice war stories and a great chapter about visualizations - this is what is hard to find in other books, and I guess it might be a new read for many computer scientists/programmers。 I also appreciate many practical examples。 A few other chapters are more or less standard and some of them, for instance, the one about distributed computing, feel like thrown there just to fill the content。 Nevertheless, I recommend reading it。

Mkfs

A good introductory book to statistical analysis data mining data science。 This is clearly aimed at students - the Coda at its conclusion exhorts the reader to now get a data science job (no thanks, got a real job already), and there is an expectation in the word-frequency discussion that the reader has never encountered the word defenestrate (ha! just last week I had to defenstrate an intruder!)。It's always good to get Skiena's take on things -- I've read three or four of his books now -- and t A good introductory book to statistical analysis data mining data science。 This is clearly aimed at students - the Coda at its conclusion exhorts the reader to now get a data science job (no thanks, got a real job already), and there is an expectation in the word-frequency discussion that the reader has never encountered the word defenestrate (ha! just last week I had to defenstrate an intruder!)。It's always good to get Skiena's take on things -- I've read three or four of his books now -- and this one is no exception。 The statistical-learner stuff is linked more closely to standard CS topics (e。g。 algorithmic complexity) than in most other texts, and the overview of linear algebra is really quite good。The only real downside is that it doesn't do what is says on the tin。 Unlike The Algorithm Design Manual, this isn't presented as a taxonomy of data science methods with a briefing of when and how each should be supplied。 More's the pity, as that particular book is sorely needed - even in this one, Skiena points out that most researchers become comfortable with one approach and use it for everything, rather than testing alternate approaches on new problems。 Instead, it's a standard Introduction to Data Science textbook with chapters devoted to topics of increasing complexity/sophistication。 Well-written, often entertaining, with an excellent selection of exercises (including many Kaggle challenges and some publicly-available datasets - precisely the sort of project that a beginner needs to get their feet wet)。 。。。more